Search CORE

15 research outputs found

A Concurrency-Optimal Binary Search Tree

Author: Aksenov Vitaly
Gramoli Vincent
Kuznetsov Petr
Malova Anna
Ravi Srivatsan
Publication venue
Publication date: 02/03/2017
Field of study

The paper presents the first \emph{concurrency-optimal} implementation of a binary search tree (BST). The implementation, based on a standard sequential implementation of an internal tree, ensures that every \emph{schedule} is accepted, i.e., interleaving of steps of the sequential code, unless linearizability is violated. To ensure this property, we use a novel read-write locking scheme that protects tree \emph{edges} in addition to nodes. Our implementation outperforms the state-of-the art BSTs on most basic workloads, which suggests that optimizing the set of accepted schedules of the sequential code can be an adequate design principle for efficient concurrent data structures

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Parallel Combining: Benefits of Explicit Synchronization

Author: Aksenov Vitaly
Kuznetsov Petr
Shalyto Anatoly
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 22nd International Conference on Principles of Distributed Systems (OPODIS 2018)
Publication date: 01/01/2018
Field of study

A parallel batched data structure is designed to process synchronized batches of operations on the data structure using a parallel program. In this paper, we propose parallel combining, a technique that implements a concurrent data structure from a parallel batched one. The idea is that we explicitly synchronize concurrent operations into batches: one of the processes becomes a combiner which collects concurrent requests and initiates a parallel batched algorithm involving the owners (clients) of the collected requests. Intuitively, the cost of synchronizing the concurrent calls can be compensated by running the parallel batched algorithm. We validate the intuition via two applications. First, we use parallel combining to design a concurrent data structure optimized for read-dominated workloads, taking a dynamic graph data structure as an example. Second, we use a novel parallel batched priority queue to build a concurrent one. In both cases, we obtain performance gains with respect to the state-of-the-art algorithms

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

The splay-list: A distribution-adaptive concurrent skip-list

Author: Aksenov Vitaly
Alistarh Dan-Adrian
Drozdova Alexandra
Mohtashami Amirkeivan
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik
Publication date: 01/01/2020
Field of study

The design and implementation of efficient concurrent data structures have seen significant attention. However, most of this work has focused on concurrent data structures providing good \emph{worst-case} guarantees. In real workloads, objects are often accessed at different rates, since access distributions may be non-uniform. Efficient distribution-adaptive data structures are known in the sequential case, e.g. the splay-trees; however, they often are hard to translate efficiently in the concurrent case. In this paper, we investigate distribution-adaptive concurrent data structures and propose a new design called the splay-list. At a high level, the splay-list is similar to a standard skip-list, with the key distinction that the height of each element adapts dynamically to its access rate: popular elements ``move up,'' whereas rarely-accessed elements decrease in height. We show that the splay-list provides order-optimal amortized complexity bounds for a subset of operations while being amenable to efficient concurrent implementation. Experimental results show that the splay-list can leverage distribution-adaptivity to improve on the performance of classic concurrent designs, and can outperform the only previously-known distribution-adaptive design in certain settings

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

IST Austria: PubRep (Institute of Science and Technology)

Цитокиновый профиль пациентов с сочетанной кардио- и офтальмопатологией

Author: Albert Vitalievich Golikov
Alexey Viktorovich Chernov
Galina Vladimirovna Shavkuta
Oleg Lvovich Fabrikantov
Oleg Valerievich Sudakov
Sergey Nikolaevich Shurygin
Tatyana Evgenievna Lipatova
Tatyana Igorevna Yakunchenko
Vitaly Vyacheslavovich Aksenov
Vladimir Nikolayevich Potapov
Publication venue: 'SPb RAACI'
Publication date: 01/08/2019
Field of study

Сочетанная кардиологическая и офтальмологическая патология имеет высокую распространённость в старших возрастных группах населения и общие патогенетические механизмы, к числу которых, безусловно, относится нарушение цитокинового профиля. Однако цитокиновый профиль крови практически не анализировался у пациентов пожилого возраста с сочетанной ишемической болезнью сердца с глаукомой. Цель исследования – изучение цитокинового профиля у пациентов с сочетанной кардио- и офтальмопатологией. Исследование выполнено в Тамбовском филиале МНТК «Микрохирургия глаза имени академика С.Н. Федорова» в двух группах: пациенты с сочетанной ишемической болезнью сердца с глаукомой (n=58 человек) и пациенты с ишемической болезнью сердца (n=49 человек), имеющих в обоих случаях одинаковый возраст 60-74 лет. Диагностика глаукомы проведена в соответствии с критериями «Национального руководства по глаукоме». Для диагностики ишемической болезни сердца выполнялись электрокардиографические, эхокардиографические, рентгенографические, энзимные исследования. Определение цитокинов в плазме крови проводилось на аппарате «Beckton Dickinson FACS Canto 2 (USA)» с помощью специального набора CBA (BD Biosciences, USA). Среди пациентов сравниваемых групп одинакового возраста выявлены достоверные различия по большинству цитокинов, а именно преимущественное повышение у пациентов с сочетанной кардио- и офтальмопатологией относительно группы с ишемической болезнью сердца. Повысилось в плазме крови пациентов с ишемической болезнью сердца, сочетанной с глаукомой, содержание IL-5, IL-12, IFN-γ, TNF-α c достоверным различием по сравнению с пациентами с ишемической болезнью сердца. Однако наивысшее увеличение среди рассматриваемых цитокинов характерно для IL-6 и IL-17, составившее у пациентов с сочетанной кардио- и офтальмопатологией 23,8±1,1 пг/мл и 20,2±1,7 пг/мл против 6,3±0,3 пг/мл и 7,9±0,5 пг/мл соответственно у пациентов с ишемической болезнью сердца. Вместе с тем существенно снизился уровень IL-4 и IL-10 до 2,2±0,2 пг/мл и 6,4±0,4 пг/мл против 4,8±0,3 пг/мл и 11,9±0,6 пг/мл. Использование логистической регрессии позволило определить величины относительного риска изученных цитокинов крови и разработать нескорректированные и скорректированные модели, согласно которым наиболее тесная ассоциация с риском развития сочетанной ишемической болезни сердца с глаукомой установлена для IL-6 и IL-17, с величинами относительного риска в нескорректированной модели 2,87 и 2,71 соответственно (p<0,001). Однако в скорректированной модели ассоциация IL-6 с сочетанной ишемической болезнью сердца с глаукомой повысилась до 2,92 (ДИ 2,80-3,27, р=0,004), а IL-17 уменьшилось до 2,64 (ДИ 2,51-2,85, р=0,003). Установлена также достоверная ассоциация IL-4, IL-5, IL-12, IFN-γ и TNF-α с сочетанной ишемической болезнью сердца с глаукомой. Исследование продемонстрировало новые ассоциации системных цитокинов с риском развития сочетанной ишемической болезнью сердца с глаукомой

Directory of Open Access Journals

Parallel Combining: Making Use of Free Cycles

Author: Aksenov Vitaly
Kuznetsov Petr
Publication venue: HAL CCSD
Publication date
Field of study

Scalable belief propagation via relaxed scheduling

Author: Aksenov Vitaly
Alistarh Dan-Adrian
Korhonen Janne
Publication venue: Curran Associates, Inc.
Publication date: 01/01/2020
Field of study

The ability to leverage large-scale hardware parallelism has been one of the key enablers of the accelerated recent progress in machine learning. Consequently, there has been considerable effort invested into developing efficient parallel variants of classic machine learning algorithms. However, despite the wealth of knowledge on parallelization, some classic machine learning algorithms often prove hard to parallelize efficiently while maintaining convergence. In this paper, we focus on efficient parallel algorithms for the key machine learning task of inference on graphical models, in particular on the fundamental belief propagation algorithm. We address the challenge of efficiently parallelizing this classic paradigm by showing how to leverage scalable relaxed schedulers in this context. We present an extensive empirical study, showing that our approach outperforms previous parallel belief propagation implementations both in terms of scalability and in terms of wall-clock convergence time, on a range of practical applications

IST Austria: PubRep (Institute of Science and Technology)

Provably and Practically Efficient Granularity Control

Author: Acar Umut,
Aksenov Vitaly
Charguéraud Arthur
Rainey Mike
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/02/2019
Field of study

International audienceOver the past decade, many programming languages and systems for parallel-computing have been developed, e.g., Fork/Join and Habanero Java, Parallel Haskell, Parallel ML, and X10. Although these systems raise the level of abstraction for writing parallel codes, performance continues to require labor-intensive optimizations for coarsening the granularity of parallel executions. In this paper, we present provably and practically efficient techniques for controlling granularity within the run-time system of the language. Our starting point is "oracle-guided scheduling", a result from the functional-programming community that shows that granularity can be controlled by an "oracle" that can predict the execution time of parallel codes. We give an algorithm for implementing such an oracle and prove that it has the desired theoretical properties under the nested-parallel programming model. We implement the oracle in C++ by extending Cilk and evaluate its practical performance. The results show that our techniques can essentially eliminate hand tuning while closely matching the performance of hand tuned codes

Crossref

HAL-Inserm

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot